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LAYER-THICKNESS DETECTION METHODS AND 
APPARATUS FOR WAFERS AND THE LIKE, AND 
POLISHING APPARATUS COMPRISING SAME 



5 Field of the Invention 

The present invention relates to any of various methods for 
planarizing a surface of a workpiece such as a semiconductor wafer 
during manufacturing semiconductor devices on the wafer. A 
representative such method is "chemical mechanical polishing" ("CMP") 
1 0 as used to planarize semiconductor wafers between certain 

manufacturing steps. More specifically, the invention pertains to 
methods and apparatus for detecting a "process endpoint" (i.e., time to 
stop planarizing). 



1 5 Background of the Invention 

Semiconductor devices (e.g., integrated circuits, displays, and the 
like) are becoming increasingly dense and highly integrated. With this 
trend, certain processes such as processes directed to formation of 
multiple wiring layers, formation of insulating films between wiring 

20 layers, formation of inter-layer connecting plugs and the like, and 

formation of electrodes are becoming increasingly more critical. For 
example, controlling the thickness and depth profiles of inter-layer 
insulating films or metal layers is now very important to ensure 
attainment of target integration levels in devices with high reliability. 

25 Achieving such control requires that layer thicknesses in steps such as 
layer-formation steps and etching steps be monitored. 

Responsive to the need to achieve ever-increasing device density 
along with ever decreasing feature sizes, the microlithography industry 
has developed microlithography apparatus that utilize extremely short 

30 wavelengths of light, such as deep-UV light, but at large numerical 

apertures. Projection optics used in such apparatus have extremely short 
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focal ranges. The focal ranges are now so short that the uneven surface 
that results from stacking multiple layers atop one another during 
manufacture of an integrated circuit are no longer in sharp focus from 
peaks to valleys of the surface. Consequently, it has become 
5 increasingly important to accurately planarize the surface of the wafer (at 
least within exposure areas) between certain layer-formation steps. It is 
also important to perform a planarization step after embedding an inlay of 
a metal electrode layer to form inter-layer connecting plugs and the like. 
Planarization typically involves the removal of material from the 

1 0 surface of the wafer. Whereas several candidate techniques have now 
been developed for performing planarization, as summarized below, a key 
problem has been how to accurately detect, during planarization, when 
to stop planarizing so as to ensure the desired amount of material has 
been removed without removing excessive material. 

1 5 Among the several conventional planarization methods, a polishing 

process termed "chemical mechanical polishing" or "chemical mechanical 
planarization" (abbreviated "CMP") has received considerable favor. This 
is because, inter alia , CMP is effective for planarizing wafers having a 
large surface area and is effective for planarizing microscopic bumps and 

20 other surficial irregularities from wafer surfaces. CMP achieves such 

results from a combination of mechanical abrasion (using an abrasive in 
liquid suspension) and chemical action (using a mild surface-eroding 
chemical in the liquid suspension). More specifically, during CMP, the 
wafer surface is urged against a polishing pad, saturated with a polishing 

25 slurry, as the wafer and polishing pad undergo motion relative to each 
other. The polishing slurry is a suspension of polishing granules (silica, 
alumina, cerium oxide, or the like, depending on the material on the 
surface of the wafer) in an acidic or basic (depending on the material on 
the wafer surface) carrier liquid. With CMP, the entire surface of the 

30 wafer can be polished uniformly by making sure that the applied 
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polishing pressure, amount of slurry used, and velocity of relative 
motions are uniform over the entire wafer surface. 

Unfortunately, achieving consistent results with CMP is much 
more difficult than with other semiconductor processing steps such as 
5 layer-forming and etching. Therefore, there is a great need for improved 
methods for monitoring the thickness of the layer(s) on a wafer being 
polished by CMP, especially such methods that can provide quick, 
accurate, and efficient feedback to the CMP apparatus. 

According to one conventional approach for monitoring polishing, 

10 changes in friction between the wafer and the polishing pad are 

monitored as corresponding changes in the torque being applied by a 
motor used to effect rotation of the wafer or polishing pad. For example, 
a change in torque is encountered when polishing has progressed to an 
underlying layer made of a material having a substantially different 

1 5 coefficient of friction than the layer being polished away. Unfortunately, 
monitoring polishing by monitoring torque is notoriously inaccurate and 
unreliable. 

Optical methods for monitoring polishing offer prospects for high 
accuracy. According to one conventional method, a small wafer "blank" 

20 region (i.e., a location on a wafer where the surficial layer is essentially 
planar and desirably not patterned) is subjected to the same CMP process 
as the remainder of the wafer and used as a measurement sample. 
Measuring the thickness of the surficial layer as polishing progresses is 
performed by monitoring changes in the blank region. 

25 Unfortunately, the "blanks-measurement method has several 

serious disadvantages. First, the method requires considerable time to 
execute and to provide feedback to the actual polishing process. Wafers 
are normally imprinted with as many devices as possible placed side by 
side. A non-imprinted portion, for use as a blank, of the wafer must be 

30 located among the devices on the wafer. The area of such a blank is 
normally very small and, on some wafers, has an unspecified location. 
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Because the size of the blank region is normally very small, the range of 
measurements that can be performed at the location is also very small. 
Available apparatus for accurately measuring layer thickness within such 
a small area are simply not available. It is also very difficult to perform 
5 measurements in such a small area at sufficiently high speed because the 
required mechanism for picking up, recognizing, and processing the 
image of the blank region is very complex. 

Also, positioning the wafer for measurement at the blank location 
poses many problems. Because the blank location is normally very small 

10 (and sometimes not even specified), accurate alignment mechanisms are 
required to ensure that the measurements are consistently performed at 
the blank location. 

Another substantial contributor to the excessive amount of time 
required to perform the "blank" technique is the need to interrupt 

1 5 polishing, clean the wafer, and transport the wafer to a remote but stable 
location for measurement. 

In another conventional optical method for monitoring polishing, 
the thickness of the layer being polished is monitored by optical 
interference. In such a technique, an optical path is provided through the 

20 polishing pad to the wafer surface being polished, and a laser light beam 
is directed through the polishing pad to the wafer surface during 
polishing. Alternatively, light (e.g., infrared light) is transmitted through 
the wafer holder and through the wafer from the rear surface of the 
wafer to the surface being polished. Temporal changes in the intensity 

25 of light reflected from the surface being polished are monitored as 

polishing progresses, and a polishing endpoint is believed to be reached 
when the intensity of the reflected laser light ceases to change with 
further polishing. Use of such a method for measuring the thickness of a 
layer at a "blank" location is normally effective in achieving a satisfactory 

30 accuracy. However, requisite accuracy is not obtained whenever the 
method is used to detect a polishing endpoint for a patterned layer 
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(which must be performed when attempting to monitor polishing in real 
time as polishing progresses). This problem is even more pronounced 
when the wafer surface is patterned with logic circuits or a combination 
of logic and memory circuits. 
5 Moreover, because the wafer undergoes motion during polishing, 

signals from a specific (blank) location on the wafer are not normally 
obtainable, either directly or extracted from signals from other portions of 
the wafer surface. This method is also susceptible to obfuscating 
effects of signal noise, making determinations of polishing endpoint even 

10 more difficult to obtain. Consequently, polishing endpoint detection by 
this technique as conventionally applied cannot be reliably realized, 
especially in real time. 

According to yet another conventional optical approach, the layer 
thickness is monitored by illuminating onto the wafer surface a light 

15 beam having a relatively large spot diameter. Optical measurements of 
light reflecting from the wafer surface are made and averaged. 
Unfortunately, if the wafer is imprinted with device patterns, the 
waveform of reflected light is usually extremely complex since the signal 
depends not only upon the layer thickness but also on the device 

20 patterns. Therefore, the thickness of a layer being polished cannot be 
measured reliably or accurately using this method. 

Other methods that have received some attention include 
detecting changes in vibration or sounds, or detecting changes in slurry 
composition that purportedly occur whenever a polishing endpoint is 

25 being approached. None of these techniques has proved to be effective 
or reliable. 

As can be ascertained from the foregoing summary, none of the 
conventional techniques for monitoring the thickness of a layer during 
polishing is satisfactory for current requirements. Each technique as 
30 conventionally applied suffers from one or more deficiencies such as, 

inter alia , excessive time required for performing the measurements and 
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providing feedback to the polishing process, inaccuracy of determinations 
of layer thickness, lack of repeatability, and/or inapplicability to certain 
types of layers or to certain steps during manufacture of a 
semiconductor device. 

5 

Summary of the Invention 

In view of the above-summarize shortcomings of the prior art, an 
object of the invention to provide, inter alia, simple and convenient 
detection methods by which the thickness of one or more layers on the 
10 surface of a workpiece (e.g., wafer) can be determined. Such 

determinations are especially useful in the determination of a process 
endpoint. 

According to a first aspect of the invention, methods are provided 
for determining a thickness of a surficial thin-film layer on a substrate as 

1 5 the thin-film layer is being subjected to a process resulting in a change in 
thickness of the thin-film layer. According to a representative 
embodiment of such a method, a probe light is illuminated onto a region 
of a surface of the thin-film layer to produce a signal light propagating 
from the thin-film layer. The signal light is detected. From the detected 

20 signal light, ^spectral characteristic of the signal light is measured to 
produce a spectral-characteristic signal. A value of a parameter of the 
spectral-characteristic signal is calculated, wherein the value is a function 
of the thickness of the thin-film layer. From the calculated value of the 
parameter, the thickness of the thin-film layer is determined. Such a 

25 process can also include the step of determining, from the calculated 
value of the parameter, an endpoint at which to terminate the process 
and thus cease changing the thickness of the thin-film layer. An 
exemplary and especially useful spectral characteristic is spectral 
reflectance (R(?J). 

30 Although such a method can be applied to any of various 

workpieces bearing one or more surficial thin layers, the method is 
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especially applicable to semiconductor wafers having one or more 
surficial thin-film layers including, for example, one or more electrode 
layers, metal layers, and insulator layers. 

Any of various parameters of the spectral characteristic can be 
5 used for determining the thickness of a layer. For example, the 

parameter can be a local maximum of the spectral-characteristic signal 
(wherein a "local maximum" is a peak appearing in a plot of the spectral 
characteristic), a local minimum of the spectral-characteristic signal 
(wherein a "local minimum" is a valley appearing in a plot of the spectral 

10 characteristic), a difference of a selected local minimum from a selected 
local maximum, or a quotient of a selected local minimum to a selected 
local maximum. Other possible parameters include a largest local 
maximum of the spectral-characteristic signal, a smallest local minimum 
of the spectral-characteristic signal, a difference of the largest local 

1 5 minimum from the largest local maximum, and a quotient of the smallest 
local minimum to the largest local maximum. Yet other parameters 
include a spectral dispersion of the spectral-characteristic signal (wherein 
a "spectral dispersion" is a variance of the spectral characteristic) and a 
component of a Fourier transform of the spectral-characteristic signal. 

20 The signal light that is detected can be probe light reflected from 

the surface of the workpiece or transmitted through the workpiece. 

As summarized above, various conventional methods for optically 
measuring the thickness of a thin film are known. In conventional 
methods employing interference phenomena, the requisite high accuracy 

25 is achieved only when the methods are used to measure the thickness of 
a blank film (i.e., a uniformly flat film). Methods according to the present 
invention (such as the method summarized above), in contrast, can be 
utilized for measuring not only the thickness of a blank film but also the 
thickness of, e.g., a layer on a wafer having a device pattern (substrate 

30 pattern) that is two-dimensionally non-uniform. In such an application 
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instance, the obtained signals differ substantially from the signals 
obtainable from a blank film. 

In methods according to the invention, such as those summarized 
above, signal light that is reflected from or transmitted through the 
5 workpiece are processed to obtain a measure of a parameter that 
desirably rapidly changes when the process is at or near a process 
endpoint. 

According to another aspect of the invention, in a process for 
progressively reducing a thickness of a thin-film layer on a surface of a 

10 substrate, methods are provided for detecting a process endpoint 
representing a minimum desired thickness of the thin-film layer. 
According to a representative embodiment of such a method, a probe 
light is directed onto a region of a surface of the thin-film layer to 
produce a signal light propagating from the thin-film layer. The signal 

1 5 light is detected and a spectral characteristic of the signal light is 

measured from the detected signal light so as to produce a spectral- 
characteristic signal. A cross-correlation function is calculated. The 
cross-correlation function is of the spectral-characteristic signal with a 
predetermined reference spectral-characteristic signal- The cross- 

20 correlation function exhibits a change with a corresponding change in the 
thickness of the thin-film layer. From the cross-correlation function, the 
process endpoint is determined. 

According to yet another aspect of the invention, apparatus are 
provided for determining a process endpoint of a process for reducing a 

25 thickness of a thin-film layer on a substrate. A representative 

embodiment of such an apparatus comprises a source of a probe light, a 
probe-light optical system, a detector, a signal-light optical system, and a 
signal processor. The probe-light optical system directs the probe light 
to a location on a surface of the thin-film layer so as to produce a signal 

30 light propagating from the location. The detector detects the signal light, 
and the signal-light optical system directs the signal light from the 
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location to the detector. The signal processor, which is connected to the 
detector, measures a spectral characteristic of the signal light from the 
detected signal light, calculates a parameter of the spectral characteristic 
that is a function of the thickness of the thin-film layer; and determines 
5 the thickness of the thin-film layer from the calculated parameter. 

According to yet another aspect of the invention, apparatus are 
provided for planarizing a surface of a workpiece. A representative 
embodiment of such an apparatus comprises a polishing pad and a 
polishing head. The polishing head is configured to support the 

10 workpiece and contact the workpiece against the polishing pad. A 

mechanism is provided to move the polishing pad and the polishing head 
relative to each other as the workpiece contacts the polishing pad for 
polishing the workpiece. This apparatus also includes a device for 
determining a process endpoint as summarized in the preceding 

1 5 paragraph, for example. 

According to yet another aspect of the invention, processes are 
provided for detecting, for example while reducing a thickness of a thin- 
film layer on a surface of a workpiece, the thickness of the thin-film 
layer. According to a representative embodiment of such a process, a 

20 probe light is directed to a location on the thin-film layer so as to produce 
a signal light propagating from the location. A signal waveform is 
produced from the signal light, and a value of a parameter of the signal 
waveform is calculated. From such a value, a thickness of the thin-film 
layer is detected. The parameter can be, for example, a difference 

25 between a largest local maximum of the signal waveform and a smallest 
local minimum of the signal waveform. Other exemplary parameters are 
the smallest local minimum of the signal waveform, a quotient of the 
smallest local minimum of the signal waveform to the largest local 
maximum of the signal waveform, and the mean of the signal waveform. 

30 An example mean is of spectral reflectance: s(R(x,))/n, wherein n is the 
sample size. 
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The method summarized in the preceding paragraph solves the 
problem of optically measuring a process endpoint with a satisfactory 
degree of accuracy in the case in which a device pattern exists on the 
wafer surface. It has been discovered that signal light obtained by 
5 illuminating a probe light onto the wafer surface is a superposition of a 
pattern interference component and a film-thickness interference 
component. It has also been discovered that the magnitude of the 
pattern-interference component changes according to changes in the 
device pattern on the wafer. In other words, since the magnitude of the 

10 pattern-interference component changes in accordance with changes in 
the device pattern on the wafer, the signal light contains a degree of 
uncertainty that corresponds to the device pattern (i.e., the type of 
device corresponding to the pattern). For example, there is a significant 
difference in the degree of uncertainty between a DRAM that is a part of 

1 5 a memory, part of a logic device, and part of a device having both a logic 
component and a memory component. This uncertainty arises because 
the pattern-interference components of these devices differ significantly 
from each other due to differences in the degree of integration, even if 
the devices are of the same type. 

20 Another type of uncertainty by which the magnitude of the 

pattern-interference component differs is the location on the pattern at 
which measurement is performed, even among devices of the same type 
and having the same degree of integration. In a memory device such as 
a DRAM, the degree of uncertainty of this type is small since the pattern 

25 on the memory device can be regarded as a uniform continuation of its 
periodic structure. However, the pattern on a logic device or a device 
having a logic component and a memory component is not uniform. 
Hence, in such a device, the degree of this type of uncertainty is high. It 
has been discovered that such types of uncertainties are major causes of 

30 large errors encountered in the detection of process endpoints and the 
like in the prior art. According to the method according to the invention 
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summarized above, the causes of such uncertainty are alleviated by 
specifying a position for measurements of the device pattern based on a 
suitable parameter obtained from the signal-light waveform. 

The method summarized above can further comprise the step of 
5 providing a reference value of the parameter corresponding to a reference 
thickness of the thin-film layer. The actual measured thickness of thin- 
film layer can be compared with the reference value to obtain a 
comparison value. From the comparison value, a process endpoint can 
be calculated, wherein the process endpoint represents a point at which 

10 to cease reducing the thickness of the thin-film layer. The method can 
also further comprise the step of specifying on the thin-film layer a 
measurement position that includes the location. In such a method, the 
measurement of the value of the parameter is performed at the 
measurement position. A reference value of the parameter, 

1 5 corresponding to a reference thickness of the thin-film layer at the 

measurement position, can be provided. In such an instance, the actual 
measured thickness of the thin-film layer can be compared with the 
reference value to obtain a comparison value. From the comparison 
value, a process endpoint can be calculated at which to cease reducing 

20 the thickness of the thin-film layer. 

According to a further modification of the method, an optical 
signal (e.g., a reflectance signal) is obtained from a desired measurement 
position on the surface of the workpiece. From the optical signal, a 
thickness of the thin-film layer is calculated. For example, thickness can 

25 be algebraically calculated from one or more local maxima in a plot of the 
optical signal, or by fitting a measured waveform to a pre-calculated 
waveform. 

The calculated thickness is compared with a reference thickness at 
the measurement position so as to determine a process endpoint at 
30 which to cease reducing the thickness of the thin-film layer. 
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According to yet another aspect of the invention, methods are 
provided, for use in processes for reducing the thickness of a thin-film 
layer on an integrated circuit device formed on a surface of a 
semiconductor wafer, for detecting the thickness of the thin-film layer. 
5 According to a representative embodiment of such a method, a probe 
light is directed to a location on the thin-film layer so as to produce a 
signal light propagating from the location. The signal light is produced 
either by reflection of probe light from the thin-film layer or transmission 
of probe light through the thin-film layer. All orders of diffracted light are 

1 0 removed from the signal light except a zeroth order of diffracted light 
from which a signal waveform is produced. A value of a parameter of 
the signal waveform is calculated, from which value a thickness of the 
thin-film layer is calculated. The higher orders (i.e., greater than zeroth 
order) of diffracted light are removed by passing the signal light from the 

1 5 location through an aperture (which can be variable-sized) defined by an 
aperture plate, wherein the aperture plate blocks the higher orders of 
diffracted light. Alternatively, a two-dimensionally distributed 
measurement of a spot pattern of the signal light can be provided while 
blocking the higher orders of signal light. (A two-dimensional spot 

20 pattern can be obtained by, e.g., detecting signal light using a sensor 
comprising two-dimensionally distributed sensor elements, such as a 
CCD panel.) 

With respect to the method summarized above, a conventional 
cause of difficulty encountered while attempting to detect the thickness 

25 of a film on the device pattern on a wafer with satisfactory accuracy is 
addressed. It has been discovered that reflected signal light obtained by 
illuminating a probe light onto the device pattern exhibits many 
diffraction spots caused by a diffraction of light reflecting from a device 
pattern having regular fine features (wherein diffraction spots are 

30 generated from a two-dimensionally distributed pattern, in comparison to 
diffraction fringes generated from a one-dimensionally distributed 
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pattern). Each of the diffraction spots changes, in a different manner 
depending on the film thickness, for each order of diffraction. Thus, the 
obfuscating influence of higher diffraction orders is eliminated. 

According to yet another aspect of the invention, apparatus are 
5 provided for determining a process endpoint of a process for reducing a 
thickness of a thin-film layer on a substrate. A representative 
embodiment of such an apparatus comprises a source of probe light, a 
probe-light optical system, a detector, a signal-light optical system, an 
aperture-defining plate, and a signal processor. The probe-light optical 

10 system directs the probe light to a location on a surface of the thin-film 
layer so as to produce a signal light propagating from the location. The 
detector detects the signal light that passes from the location through 
the signal-light optical system to the detector. The aperture plate is 
situated in the signal-light optical system, wherein the aperture (which 

1 5 can be variable) functions to remove all orders of diffracted light from the 
signal light except zero-order reflected light. The signal processor, to 
which the detector is connected, measures a spectral characteristic of 
the signal light from the detected signal light. The signal processor then 
calculates a parameter of the spectral characteristic that is a function of 

20 the thickness of the thin-film layer, and determines the thickness of the 
thin-film layer from the calculated parameter. This apparatus can be 
incorporated into an apparatus for planarizing the surface of a workpiece 
such as a wafer. 

According to yet another aspect of the invention, methods are 

25 provided for measuring the thickness of at least one of an insulating layer 
and a metal electrode layer on a surface of a semiconductor device 
undergoing a process in which the layer (imprinted with a pattern) is 
being reduced in thickness (such as CMP). According to a representative 
embodiment of such a method, a probe light is illuminated onto at least a 

30 portion of a surface of the layer on the wafer so as to produce a signal 
light (e.g., an optical interference pattern) propagating from the layer. 
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The signal light is detected, an intensity profile of the signal light is 
measured (desirably relative to wavelength), and a spatial coherence 
length of the signal light is determined. The spatial coherence length of 
the signal light is compared with a degree of fineness of the pattern 
5 illuminated by the probe light, and an optical model is determined based 
on the comparison. (The optical model is essentially a simulated 
distribution of features on the surface of the device, based on the 
particular waveform of signal light propagating from the actual surface of 
the device. The optical model is generated under the premise that it 

10 produces the same waveform of signal light as an actual corresponding 
device under the same incidence conditions of probe light. That is, signal 
light produced by the actual device and signal light produced by a surface 
according to the optical model would be substantially the same if the 
incidence conditions are the same.) 

1 5 Based on the optical model, a theoretical intensity profile of the 

signal light is determined. The thickness of the layer and/or a process 
endpoint is determined by comparing the measured intensity profile of 
the signal light with the theoretical intensity profile of signal light. In 
such a method, the spatial coherence length of the probe light can be 

20 varied, desirably according to the degree of fineness of the pattern. The 
calculated theoretical intensity profile of signal light can be stored for 
later recall. The theoretical intensity profile can be calculated for a 
thickness of multiple films having an inter-film distance therebetween, 
wherein the comparison is made based on a similarity between the 

25 calculated theoretical intensity profile of signal light and the measured 
change in the signal-light intensity profile. Alternatively, a cross- 
correlation coefficient of the theoretical intensity profile of signal light 
and the measured intensity profile of signal light can be calculated. In 
the latter instance, the comparison can be made based on a similarity 

30 between a cross-correlation coefficient of a Fourier transform of the 
theoretical intensity profile of signal light and the measured intensity 
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profile of signal light, and a position and magnitude of a Fourier 
component of the calculated theoretical intensity profile of signal light 
and a position and magnitude of a Fourier component of the measured 
intensity profile of signal light. 
5 According to yet another aspect of the invention, apparatus are 

provided (in the context of an apparatus for planarizing a surface on a 
semiconductor wafer imprinted with a semiconductor device) for 
measuring a thickness of a layer on a surface of the semiconductor 
device imprinted on the wafer so as to provide a planarizing process 

10 endpoint. According to a representative embodiment, the apparatus 
comprises an illumination system, a measuring system, a numerical 
calculation system, and a detection system. The illumination system 
illuminates a probe light onto a portion of the surface of the layer on the 
wafer so as to produce a signal light propagating from the surface. The 

1 5 measuring system measures a change in an intensity of the signal light. 
The numerical calculation system is connected to the measuring system 
and calculates a theoretical intensity profile of signal light based on an 
optical model. The optical model is based on a comparison of a spatial 
coherence length of the probe light with a degree of fineness of a pattern 

20 for the semiconductor device illuminated with the probe light. The 
detection system detects at least one of a layer thickness and the 
process endpoint by comparing the measured intensity profile of signal 
light with the calculated theoretical intensity profile of signal light. A 
controller can be connected to the numerical calculation system and 

25 employed to control the spatial coherence length of the probe light. The 
controller can comprise a storage system (e.g., memory) that stores data 
concerning the calculated theoretical intensity profile of signal light. The 
detection system performs the comparison using a cross-correlation 
coefficient of the calculated theoretical intensity profile of the signal light 

30 and a measured intensity profile of the signal light. The detection system 
also performs a similarity comparison using a cross-correlation coefficient 
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of a Fourier transform of the calculated theoretical intensity profile of the 
signal light and the measured intensity profile of the signal light, and/or a 
cross-correlation coefficient of a position and magnitude of a Fourier 
component of the calculated theoretical intensity profile of the signal 
5 light and a position and magnitude of a Fourier component of the 
measured intensity profile of the signal light. 

The foregoing and other features and advantages of the invention 
will be more readily apparent from the following detailed description, 
which proceeds with reference to the accompanying drawings. 

10 

Brief Description of the Drawings 

FIG. 1 shows a probe light P incident on a target surface to 
produce rays of signal light reflected from the target surface as well as 
various depths below the target surface. 
1 5 FIG. 2 shows certain aspects of the measurement apparatus used 

in Example 1 . 

FIG. 3 shows certain details of the measuring optical system used 
in the FIG. -2 apparatus. 

FIG. 4(a) is a plot of a waveform-dependent profile of the spectral 
20 reflectance of a wafer in a pre-polish state as described in Example 1 . 

FIG. 4(b) is a plot of the waveform-dependent profile of the 
spectral reflectance of a wafer in a post-polish state as described in 
Example 1 . 

FIG. 5 is a plot, with respect to Example 2, of the change in the 
25 measurement parameter (global maximum spectral reflectance/global 
minimum spectral reflectance) with respect to layer thickness as 
polishing progressed. The polishing endpoint is indicated with an arrow. 

FIG. 6 is a plot, with respect to Example 2, of the change in 
intensity of total reflection with respect to layer thickness as polishing of 
30 the layer progressed. The polishing endpoint is indicated with an arrow. 
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FIG. 7 is a plot, with respect to Example 3, of the change in 
dispersion of the spectral reflectance with respect to layer thickness as 
polishing of the layer progressed. The polishing endpoint is indicated 
with an arrow. 

5 FIG. 8 is a plot, with respect to Example 4, of a suitable 

wavenumber component of a Fourier transformation of the spectral 
reflectance, with respect to layer thickness as polishing of the layer 
progressed. The polishing endpoint is indicated with an arrow. 

FIG. 9(a) is a plot, with respect to the Fifth Embodiment, of 
10 reflectance as a function of wavelength of signal light produced by 

reflection of probe light from a wafer surface imprinted with a pattern 
having an extremely fine pitch, such as that of memory cells. 

FIG. 9(b) is a plot, with respect to the Fifth Embodiment, of 
reflectance as a function of wavelength of signal light produced by 
1 5 reflection of probe light from a wafer surface imprinted with a pattern 
having a coarse pattern pitch, such as that of a wiring layer. 

FIG. 10 is a schematic plan view, with respect to the Sixth 
Embodiment and Example 6, of various regions of a representative device 
pattern, such a pattern for a microprocessor. 
20 FIG. 1 1 (a) includes plots, with respect to Example 6, of 

reflectance versus wavelength obtained from regions A, B, and C, 
respectively, of FIG. 10 which have fine features. 

FIG. 1 1(b) includes plots, with respect to Example 6, of 
reflectance versus wavelength obtained from regions D, E, and F, 
25 respectively, of FIG. 10 which have coarser features than regions A, B, 
and C. 

FIG. 12 depicts, with respect to the Seventh Embodiment, the 
production of diffraction spots in signal light reflected from a wafer 
surface. The pattern on the wafer has a feature pitch "d", wherein the 
30 nth diffraction spot is a function of the pitch of the pattern and the 
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wavelength x oi the probe light in the direction of the reflection direction 
9- 

FIG. 13 depicts an apparatus according to the Seventh 
Embodiment. 

5 FIG. 14(a) includes plots, with respect to Example 10, of 

measured and calculated values of spectral reflectance of a wafer (zeroth 
order diffracted signal light only) at start of polishing. 

FIG. 14(b) includes plots, with respect to Example 10, of 
measured and calculated values of spectral reflectance of a wafer (zeroth 
10 order diffracted signal light only) at the polishing endpoint. 

FIG. 15 depicts a measurement optical system according to the 
Eighth Embodiment. 

FIG. 16 is a schematic depiction of the manner in which the probe 
light illuminates multiple features of a pattern on the wafer surface in the 
1 5 Eighth Embodiment. 

FIG. 17 shows a magnified vertical section of a portion of the 
pattern structure on the wafer surface, illustrating the principle by which 
the reflected signal light from individual features is superposed in the 
Eighth Embodiment. 

20 

Detailed Description 

The present invention is described below in the context of multiple 
representative embodiments and examples. However, it will be 
understood that the scope of the invention is not limited to these 

25 embodiments. 

For performing optical measurements according to the invention, a 
light source is desirably used that produces a continuous illumination 
light ("probe light") consisting of multiple wavelengths. The probe light, 
desirably in the form of a beam of light, is directed to the surface 

30 ("target surface") of a workpiece undergoing polishing or other 
processing step. 
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Probe light reflected from the target surface or transmitted through 
the target surface (i.e., "signal light") has a wavelength-dependent profile 
of any of various characteristics ("spectral characteristics") relative to 
the incident probe light that is characteristic of the surface. For example, 
5 a particularly useful spectral characteristic is wavelength-dependent 

reflectance {i.e., spectral reflectance). The spectral characteristics useful 
in the present invention can be detected directly and/or calculated from 
data provided by a suitable detector. An electrical signal resulting from 
such detection and/or data processing can be subjected to signal 
1 0 processing to extract data concerning the target surface, including the 
thickness and patterning of one or more thin-film layers on the target 
surface. 

The probe light is not limited to visible light. For example, for 
probe light illuminated onto the rear side of the workpiece (to allow 

1 5 detection of signal light produced by transmission of probe light through 
the workpiece), infrared light may be most effective. 

The spot diameter of probe light incident on the target surface is 
desirably larger than the minimum feature size of any pattern imprinted 
on the target surface. This desirably allows at least several periods of 

20 the incident probe light to be illuminated on the target surface. 

Typically, the target surface of a semiconductor wafer or 
analogous workpiece comprises a laminate of multiple thin-film layers 
each defining a respective pattern extending in two dimensions. Each 
layer typically has a respective pattern. The patterns are typically 

25 interconnected with each other in three dimensions. As shown 

schematically in FIG. 1 , a probe light P is incident on a target surface to 
produce rays of signal light 1, 3, 5, 7 reflected from the target surface. 
The signal light comprises light reflected from various surfaces 2, 4, 6, 
8. For example, some of the signal light 1 is produced by reflection of 

30 probe light P from the surface 2 of an outermost layer on the target 

surface. Other signal light 3, 5, 7 is produced by reflection of probe light 
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from surfaces 4, 6, 8, respectively, of one or more layers situated 
beneath the outermost layer. Such reflection from multiple layers results 
in a signal light having a complex interference pattern. A spectral 
characteristic (e.g., spectral reflectance) of signal light reflected from a 
5 target surface as shown in FIG. 1 usually differs greatly from a similar 
spectral characteristic of signal light reflected from a "blank" target 
surface. (A "blank" target surface is a planar surface. It can be 
patterned but is desirably not patterned). 

Calculating the thickness of a patterned layer on the target surface 

10 directly from the spectral characteristic of signal light reflected from such 
a target surface is difficult. In such calculations, the spectral 
characteristic of signal light reflected from a patterned layer of a 
prescribed thickness is measured. The pre-determined spectral 
characteristic is compared to a similar spectral characteristic of signal 

1 5 light reflected from such a layer undergoing a change such as polishing. 
Such methods require execution of a complicated and time-consuming 
procedure of inputting the necessary parameters for performing the 
necessary calculations. Such procedures are also prone to introduction 
of errors, resulting in low measurement accuracy. 

20 Methods and apparatus according to the invention can be applied 

to measurements of signal light reflected from the target surface as well 
as of signal light transmitted through the target surface. The various 
embodiments described herein introduce substantial simplifications to 
prior-art methods and apparatus and provide substantial improvements in 

25 measurement accuracy. 

For simplicity, much of the following discussion of the various 
embodiments is directed to measurements of reflected signal light. 
However, it will be understood that the principles of the invention can be 
applied with equal facility to measurements of transmitted signal light. 

30 In the various embodiments, a probe light comprising multiple 

wavelengths is illuminated onto a target surface of a workpiece. For 
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simplicity, the workpiece is generally referred to herein as a "wafer." 
However, it will be understood that the workpiece can be any of various 
objects that can be subjected to a polishing operation or analogous 
process in which monitoring of changes to the surface thereof is 
5 desirable. 

In certain embodiments of the invention, of the probe light incident 
on the target surface, only the zeroth-order of the reflected signal light is 
detected. Certain spectral characteristics (e.g., spectral reflectance) of 
zeroth-order signal light from the target surface changes in a complex 

10 manner, due to interference, with respect to the wavelength of the probe 
light. However, in plots of such spectral characteristics (e.g., 
reflectance) versus wavelength of reflected zeroth-order light, notable 
maxima and minima are exhibited. 

To a first approximation, the spectral reflectance of reflected 

1 5 zeroth-order signal light from the target surface comprises multiple 

interference components generated by probe light passing through the 
thickness dimension of the uppermost layer on the target surface. Other 
interference components are contributed by probe light scattered by 
bumps in the uppermost layer, and by probe light reflected from pattern 

20 features in underlying layers. 

For maximal accuracy when monitoring changes to a target 
surface (such as during polishing of the target surface), it is desirable 
that the selected spectral characteristic change rapidly over the time 
during which the target surface is changing and measurements are being 

25 obtained. For example, it is desirable that the selected spectral 

characteristic change rapidly during polishing of the target surface up to 
the moment planarization of the target surface is achieved. 



30 



First Embodiment 

In this embodiment, wavelengths corresponding to local maxima of 
the spectral reflectance and wavelengths corresponding to local minima 
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of the spectral reflectance are obtained by processing a spectral 
reflectance signal within the measurement wavelength range. Such local 
maxima and minima (along with their corresponding wavelengths) are 
used as measurement parameters of the spectral characteristic. For 
5 example, a polishing endpoint for a patterned wafer undergoing CMP can 
be detected with high accuracy by monitoring changes in the local 
maxima signals or changes in the local minima signals. 

Besides simply local maxima and local minima of spectral 
m reflectance, other useful parameters for measurement purposes are 

m 10 differences (e.g., local maximum - local minimum) and quotients (e.g., 
if ~! local maximum/local minimum). Also, in place of local maxima, the 

4* largest local maximum obtained in the measured wavelength range of the 

I spectral reflectance can be used as a measurement parameter. Similarly, 

in place of local minima, the smallest local minimum obtained in the 
N 1 5 measured wavelength range of the spectral reflectance can be used as a 
p measurement parameter. Yet other useful parameters are the difference 

r " (global maximum - global minimum) and the quotient (global 

maximum/global minimum). As used herein, a "global maximum" is the 
maximum, of all the maxima in the spectral signal, having the greatest 
20 magnitude; similarly, a "global minimum" is the minimum, of all the 
minima in the spectral signal, having the lowest magnitude. 

Whenever a signal change in the local maxima or global maximum, 
or (global maximum - global minimum), or (global maximum/global 
minimum) is used, the polishing endpoint can be detected with a high 
25 degree of accuracy by detecting the time during polishing at which these 
parameters rapidly increase. 

Whenever a signal change in the local minima or a global minimum 
is used, the polishing endpoint can be detected with a high degree of 
accuracy by detecting changes in any of these parameters with time. 
30 By illuminating a probe light having a spot diameter larger than a 

feature size onto a patterned wafer (or other workpiece) and measuring 
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the reflected signal light, the measured reflected signal light will contain 
a pattern-interference component. The pattern-interference component is 
generated by interference of a pattern-diffraction component and a 
component generated by the thin film on the wafer surface. During 
5 polishing, as the wafer surface is planarized, the amount of light 
scattered from the surface decreases and the diffraction efficiency 
increases. This diffracted light increases the local maxima of the zeroth- 
order light and decreases the local minima of the zeroth-order light. 
These parameters rapidly increase or decrease around the process 
10 endpoint. 

Second Embodiment 

In the second embodiment, the wavelength dispersion of a 
reflectance signal (as a representative spectral characteristic) within the 

1 5 wavelength range of probe light is used as a measurement parameter. 

The dispersion of the wavelength of the resulting signal light changes as 
polishing progresses. The dispersion of signal-light wavelength exhibits a 
rapid change immediately before and after the polish endpoint is reached. 
Therefore, by monitoring the wavelength dispersion of the signal light 

20 and detecting when the dispersion exhibits a rapid change, the polish 
endpoint can be detected with high accuracy. 

A suitable measure of wavelength dispersion is the standard 
deviation or variance of the signal-light wavelength. As the surface of 
the wafer is polished, the signal-light dispersion exhibits a change. The 

25 dispersion exhibits a rapid increase immediately before and after the 

surface of the wafer is planarized. As in the first embodiment, the effect 
of pattern interference appears prominently immediately before and after 
planarization is achieved during polishing. The reflectance also changes 
rapidly with respect to wavelength as polishing progresses, which is 

30 believed to be the cause of the increase in spectral reflectance. 
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Third Embodiment 

In the third embodiment, the waveform of the spectral reflectance 
(as a representative spectral characteristic) of signal light is subjected to 
Fourier-transformation. An appropriate wavenumber component of the 
Fourier-transformed waveform is used as a measurement parameter. The 
wavenumber component rapidly increases immediately before and after 
the wafer surface reaches a planar condition. 

This rapid increase is believed to be caused by the rapid increase 
in the interference component arising from the inter-pattern interference 
of the spectral reflectance signal immediately before and after the wafer 
surface is planarized. 

The wavenumber component of the wave number that most 
rapidly increases immediately before and after the wafer surface is 
planarized can be determined for use as a measurement parameter by 
routine experiments and/or simulations. 

Fourth Embodiment 

In the fourth embodiment, a cross-correlation function is used as a 
measurement parameter. The cross-correlation function of this 
20 embodiment is of the waveform of the spectral reflectance as measured 
in situ (i.e., while CMP or other process is ongoing) and a reference 
spectral reflectance previously obtained by measurement (and stored, for 
example, in a memory). The reference spectral reflectance can also be 
obtained by a simulation calculation based on various data such as the 
25 dimensions of the device pattern at the time polishing or analogous 
process is terminated. 

The waveform of the spectral reflectance changes as polishing 
progresses. The cross-correlation function generally exhibits a local 
maximum at the process endpoint. By detecting the instant at which the 
30 cross-correlation function exhibits a local maximum, the process endpoint 
can be detected with high accuracy. 
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The embodiments described above, based on parameters involving 
pattern-diffraction phenomena, are useful for detecting the process 
endpoint when planarizing a dielectric layer such as Si0 2 . The methods 
are also useful for monitoring removal of residual metal left over on the 
5 surface of the workpiece from a plug-forming step or similar step. 
During removal of such metal or of a metal layer, the wavelength- 
dependent profile of the parameter undergoes a rapid change as the 
underlying device pattern becomes revealed as polishing progresses from 
a starting condition in which the entire surface is a metal-layer surface. 
10 Thus, under such polishing conditions, the process endpoint is detectable 
with even greater accuracy. 

Example 1 

In this example, a 6-inch wafer was destined for use as an image- 

1 5 pickup device. An Si0 2 layer, applied to the wafer for use as an 

interlayer insulator, was subjected to polishing by CMP. The polishing 
endpoint was detected. FIG. 2 shows certain aspects of the 
measurement apparatus used in this example, and FIG. 3 shows certain 
details of the measuring optical system used in the FIG. -2 apparatus. 

20 In FIG. 2, a polishing pad 1 3 is affixed to a polishing plate 14. A 

wafer 1 2 is mounted to a polishing head 11. By rotating the polishing 
head 1 1 and the polishing plate 14, a relative motion is created between 
the wafer 1 2 and the polishing pad 1 3 to polish the wafer 1 2. As shown 
in FIG. 1, an opening (desirably about 2 cm in diameter) is defined in the 

25 polishing plate 14 to transmit a beam of probe light through the polishing 
plate 14 to the wafer surface. The polishing pad 13 is made of an epoxy 
resin and contacts the bottom surface of the wafer 1 2. A transparent 
window 1 5 (made of quartz) is provided in the opening in the polishing 
plate 14. At the polishing surface of the polishing pad 13, the window 

30 1 5 is coplanar with the polishing surface 1 3. 
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Turning now to FIG. 3, a halogen lamp 19 projects a probe light P 
through lenses 21a-21c and through a beamsplitter 22 vertically onto the 
surface 23 of the wafer 1 2. The signal light S reflected by the surface 

23 passes through the lens 21c and is reflected by the beamsplitter 22. 

5 The signal light S then passes through a lens 21d, is reflected by a mirror 
26, passes through a lens 21e, passes through a pinhole 25a defined by 
a plate 25 (that blocks scattered and diffracted light), and passes 
through a lens 21 f to a diffraction grating G. The diffraction grating G 
resolves the wavelengths of the signal light S to allow different 
10 wavelengths to propagate in different directions toward a linear sensor 

24 (e.g., optical diode or "512" device) that detects the various 
separated wavelengths of signal light. The wavelength-measurement 
range of the sensor 24 is approximately 400 nm to 700 nm in this 
example. The probe light P incident on the wafer 12 has a diameter of 

1 5 about 1 mm. The spectral-reflectance signal output from the linear 

sensor 24 is amplified and routed to a signal processor 17 that calculates 
therefrom a useful measurement parameter. The signal processor 17 
also determines the polishing endpoint from the temporal change in the 
measurement parameter as a function of the thickness of the layer being 

20 polished. The signal processor 17 also outputs a control-related signal 

such as the spectral reflectance as a function of time or film thickness to 
a display 1 8 if desired. 

During polishing, a dispenser 104 discharged a CMP polishing 
slurry 105 onto the surface of the wafer 1 2. The slurry was an alkaline 

25 aqueous suspension of silica particles. During polishing, the pressure 
applied by the polishing pad 13 to the wafer surface was about 100 
grams per square cm. Loss of probe light P due to the abrasive slurry 
(mainly due to scattering) was below 1 percent. 

The surface of the wafer 1 2 subjected to polishing was a 1000- 

30 nm thick Si0 2 insulating layer applied by chemical vapor deposition over 
a patterned metal layer 500 nm thick. Thus, the wafer 1 2 had steps 
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about 500 nm high on its surface. To polish such a surface, the 
polishing pad 13 was hard. Polishing resulted in a satisfactory degree of 
planarization, in which the steps were reduced to less than 10 nm. The 
signal waveform of the spectral reflectance changed from a pre-polish 
5 state as shown in FIG. 4(a) to a post-polish state as shown in FIG. 4(b). 
From the spectral reflectance data, the signal processor 1 8 
calculated an appropriate measurement parameter. By monitoring the 
measurement parameter as polishing progressed, the polishing endpoint 
was determined easily with high accuracy. 

10 

Example 2 

This example differs from Example 1 in that, in Example 2, the 
quotient (global minimum spectral reflectance)/(global maximum spectral 
reflectance) was used as the measurement parameter. The signal 

15 processor 17 (FIG. 2) calculated local minima and local maxima by 

differentiating the spectral reflectance signal. From such local minima 
and maxima, the signal processor 1 7 obtained the global maximum, the 
global minimum, and the quotient (global maximum/global minimum). To 
obtain the global maximum and global minimum, the signal processor 

20 compares each value of the local maxima and the local minima, 

respectively. The signal processor then determines which local maximum 
has the greatest signal magnitude and identifies that particular local 
maximum as the global maximum, and determines which local minimum 
has the lowest signal magnitude and identifies that particular local 

25 minimum as the global minimum. The measurement parameter (global 

maximum/global minimum) was monitored with respect to the thickness 
of the layer being polished as polishing progressed. FIG. 5 depicts the 
change in the quotient with respect to layer thickness as polishing 
progressed. As shown in FIG. 5, the quotient exhibited a rapid decrease 

30 around the polishing endpoint (at which time the surficial steps had been 
eliminated by polishing). Thus, by monitoring the quotient in the vicinity 
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of the polishing endpoint, the moment to cease polishing was determined 
easily with high accuracy. 

FIG. 6 shows the change in intensity of total reflection with 
changes in the thickness of the layer being polished. The polishing 
5 endpoint is indicated with an arrow. Since total reflection changes 

slowly in the vicinity of the polishing endpoint, this parameter is not as 
suitable for accurately determining polishing endpoint as the quotient 
parameter shown in FIG. 5. 

10 Example 3 

This example differs from Examplel in that, in Example 3, the 
wavelength dispersion of the spectral reflectance was determined for 
each measured value of the spectral reflectance, and the dispersion 
values were used as the measurement parameter. (The wavelength 

1 5 dispersion of the spectral reflectance is the variance of reflectance as 
measured over a range of wavelengths, i.e., over a spectral range. The 
"variance" is the square of the standard deviation, as known in the art.) 

FIG. 7 is a plot of the change in dispersion of the spectral 
reflectance signal with changes in the thickness of the layer being 

20 polished. The polishing endpoint is indicated with an arrow. As shown 
in FIG. 7, the parameter rapidly increases in the vicinity of the polishing 
endpoint. Thus, the polishing endpoint is detected with high accuracy. 

Example 4 

25 This example differs from Example 1 in that, in Example 4, the 

spectral reflectance signal is Fourier-transformed and an appropriate 
wavenumber component of the Fourier-transformed spectral reflectance 
signal is used as the measurement parameter. 
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The Fourier transform was calculated using the following equation 



In the foregoing equation, \, k, f(;J, and F(k) are the wavelength, the 
wavenumber, the spectral reflectance (wavelength-dependent reflectance 
spectrum), and the Fourier transform of f(^), respectively. In this 
example, the Fourier transform of f (jj was calculated from the spectral 
reflectance signal obtained at discrete wavelengths over a wavelength 
range. 

The manner in which the optimal wavenumber is determined 
depends on the profile and dimensions of the pattern defined by the 
surficial layer on the wafer. Hence, the optimal wavenumber component 
can be calculated from the profile and dimensions of the pattern. 
However, multiple wavenumber components can be selected, and 
changes in each wavenumber component can be simulated with respect 
to the change in the thickness of the layer being polished. In this 
instance, the wavenumber component having a plot that changes most 
rapidly in the vicinity of the polishing endpoint is desirably selected as 
the optimal wavenumber component. 

As shown in FIG. 8, the measurement parameter rapidly changed 
in the vicinity of the polishing endpoint. Thus, the polishing endpoint 
was easily detectable with high accuracy. 

Example 5 

This example differs from Example 1 in that, in Example 5, a 
cross-correlation function was calculated and used as the measurement 
parameter. The cross-correlation was of the spectral reflectance signal 
that changes with corresponding changes in layer thickness and a 




(1) 
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reference spectral reflectance signal (previously obtained and stored) 
corresponding to the polishing endpoint. 

A plot (not shown) of the cross correlation function revealed that 
the spectral reflectance signal that changes with corresponding changes 
5 in layer thickness coincides approximately with the stored reference 

spectral reflectance signal obtained at the polishing endpoint. The cross- 
correlation function also rapidly increased in the vicinity of the polishing 
endpoint. By detecting an appropriate point in the vicinity of the 
polishing endpoint, it is thus possible to easily determine, with high 
10 accuracy, when to terminate polishing. 

The cross-correlation function was defined by the following 
equation: 



15 wherein f(^), g(A,)/ anc * RxyW represent the spectral reflectance signal that 
changes as polishing progresses, the reference spectral reflectance 
signal, and the defining equation of the cross-correlation function, 
respectively. The cross-correlation coefficient is defined as the value of 
the cross-correlation function in which f(^) differs from g(x) by the 

20 wavelength T . 

The reference spectral reflectance can be obtained by simulation 
but also can be obtained in advance by measurement. 

Although methods as exemplified in each of the foregoing 
examples can be used independently, the methods can also be used in 

25 combination to achieve satisfactory results. 



CO 




(2) 



Fifth Embodiment 

Certain features of a polishing apparatus according to this 
embodiment are shown in FIG. 2. The FIG. -2 apparatus is operable to 
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detect the thickness of a surficial layer or a process endpoint. As 
summarized above, the FIG. -2 embodiment comprises a polishing head 
1 1 for holding a wafer 1 2 as a workpiece and for urging the wafer 1 2 
against a polishing pad 13. The polishing pad 13 is mounted to a 
5 polishing plate 14. Extending through the polishing plate 14 and the 
polishing pad 1 3 is a translucent window 15. Probe light P reflected 
from the surface of the wafer 1 2 passes through the window 1 5 and is 
received by a light sensor 16. The light sensor 16 is connected to a 
signal processor 17, which is connected to a display 18. The FIG. -2 

10 apparatus also comprises a wafer-rotation mechanism (only the rotation 
direction 30 is shown) that creates a relative motion between the 
polishing pad 13 and the wafer 12, and a polishing-plate-rotation 
mechanism (only the rotation direction 40 is shown). As the wafer 12 is 
being polished, the wafer 1 2 is urged (arrow 20) against the surface of 

15 the polishing pad 13. Meanwhile, a relative motion between the 

polishing pad 1 3 and the wafer 1 2 is generated by the rotational motion 
40 of the polishing pad 1 3 and the rotational motion 30 of the wafer 1 2. 
An abrasive slurry 105 is supplied from a dispenser 104. Probe light P 
and signal S pass through the window 1 5 in order to detect the 

20 thickness of the surficial layer on the wafer 1 2 or the process endpoint 
as polishing is ongoing. Based on the signal processed by the signal 
processor 17, the thickness of the surficial layer on the wafer 12 is 
monitored. 

In order to perform optical monitoring of the wafer 12, the probe 
25 light is desirably a continuous light having multiple wavelength 

components, e.g., a white light or spectral components of white light. A 
signal, corresponding to a spectral characteristic of the signal light 
reflected from the wafer 1 2, is processed by the signal processor 17. 
The probe light P may be illuminated either directly onto the surface of 
30 the wafer 1 2 (as shown), or onto the underside of the wafer 1 2 (in the 
latter instance, probe light transmitted through the wafer 1 2 is detected 
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as signal light, and the probe light is preferably a multiple-component 
infrared light). 

As discussed above, probe light reflected as signal light by the 
pattern on the surface of the wafer 1 2 can be regarded as a 
superposition of light waves reflected from each of multiple layers of 
laminated thin films on the wafer surface. Each layer defines a 
respective pattern. The waveform of a spectral characteristic of the 
signal light is a result of a complicated interference effect. Hence, the 
waveform of the spectral characteristic of signal light reflected by a 
patterned layer differs substantially from the waveform of the spectral 
characteristic of signal light reflected from a planar non-patterned layer 
(blank layer), even if the thickness of the patterned layer is the same as 
the thickness of the blank layer. 

It is generally complicated to calculate the thickness of a selected 
patterned layer on the wafer 1 2 directly from the waveform of the 
spectral characteristic of the light reflected by the patterned layer. 
However, as discussed above, the spectral reflectance of signal light 
from an otherwise similar control patterned layer having a prescribed 
thickness can be pre-determined and used as a standard against which a 
measured spectral reflectance for a selected layer is compared. Hence, 
the thickness of the selected layer is measured with high accuracy. 
Polishing is terminated when the measured reflectance from the selected 
patterned layer matches the pre-calculated reflectance of the control. 

A spectral reflectance for a particular sample workpiece can be 
measured using a dummy wafer. The spectral reflectance obtained with 
the dummy wafer can be used as a reference control for use when 
polishing other similar wafers. 

Whenever a pattern on a wafer is a one- or two-dimensional 
periodic pattern distributed uniformly throughout the wafer surface (e.g., 
a DRAM pattern) and the area illuminated by the probe light is wider than 
the pitch of the pattern, a stable and position-independent signal can be 
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obtained from average spectral reflectance values. Also, in such an 
instance, the layer thickness can be measured easily in situ (during 
polishing) because the same spectral reflectance waveform can be 
obtained by reflection of probe light from any location on the wafer (the 
5 pattern is the same over substantially the entire wafer surface). 

But, if the pattern on the wafer is for, e.g., a microprocessor or 
ASIC chip in which the pattern is not uniformly distributed, the spectral 
waveform can differ substantially depending upon the specific location 
actually illuminated by the probe light. Despite such variability in the 
10 signal, it has been discovered that, with respect to microprocessor or 
ASIC patterns on a wafer surface, a reasonably stable and reproducible 
signal nevertheless can be obtained from the wafer surface. This is 
because the average pitch of patterns for such devices is typically about 
the same throughout, regardless of the non-uniformity of the pattern. 
1 5 We have analyzed the waveforms of signals from such patterns (spectral 
reflectance or spectral waveforms) and noticed that the profiles of such 
waveforms are greatly dependent upon the degree of fineness of the 
pattern. In other words, device patterns having approximately the same 
degree of fineness exhibit approximately the same spectral waveform. 
20 In performing a measurement using probe light that impinges 

orthogonally onto the wafer surface, the layer thickness and the feature 
density of the pattern on the wafer surface (in addition to the degree of 
fineness) influence the waveform obtained from the probe light. Here, 
the feature density is defined as the proportion of the total pattern 
25 occupied by pattern features (which typically appear as raised areas on 
the surface of a wafer that has not yet been planarized). The raised 
areas corresponding to pattern features are usually formed of a layer of a 
dielectric superposed on a metal layer. Normally, during formation of 
such layers, the film thickness is substantially uniform over the wafer 
30 surface, and the feature density is in the range of 30 to 50 %. It has 
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been discovered that the influence of these factors on the position 
dependency of the spectral waveform is not significant. 

For example, spectral reflectance data were obtained with 
representative device patterns on a wafer. Two patterns were evaluated, 
5 both having the same layer thickness and feature density. One pattern 
had an extremely fine pattern pitch such as that of memory cells (FIG. 
9(a)), and the other had a coarse pattern pitch such as that of a wiring 
layer (FIG. 9(b)). Comparing FIG. 9(a) with FIG. 9(b), a distinctive 
difference was observed in the waveform obtained with each of these 
10 patterns. Based on this observation, it was discovered that the degree of 
fineness of the pattern has a greater influence on the overall waveform 
than the feature density. 

Therefore, the reflectance of probe light from a pattern on a wafer 
is dependent upon the degree of fineness of the device pattern and 
1 5 results from differences in the interference phenomena exhibited by 

probe light reflected from such patterns. In this regard, it will be recalled 
in the discussion above regarding FIG. 1 that light reflected from a 
patterned laminate of thin films is a superposed interference 
phenomenon. The interference is due to amplitude splitting due to the 
20 thickness of the layers and wave-surface splitting due to interaction of 
probe light with the features of the pattern. These pattern-dependent 
interference phenomena are generated between features within the 
spatial coherence length of the illuminating optical system. (The "spatial 
coherence length" is a distance, on the device pattern, in which light 
25 irradiated onto the pattern has coherency. Therefore, these pattern- 
dependent interference phenomena do not occur when the feature width 
is larger than the coherence length. 

With a coarse pattern having a large feature width, the spectral 
waveform appears to be determined by the sum of all the light intensities 
30 from the features of the pattern. This was confirmed by comparing 

actual measurement data with results obtained in a simulation calculation 
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using a mathematical model based on interference caused by amplitude 
splitting, interference caused by wave-surface splitting, and the 
coherence length. 

5 Sixth Embodiment 

According to this embodiment, a determination is made of whether 
probe light is properly illuminated on a device pattern on a wafer 
workpiece based on the spectral waveform of reflected probe light from 
the pattern. If the probe light is actually incident on a device pattern, 

1 0 then a determination is made of which portion of the pattern is being 

illuminated by the probe light (e.g., region A, B, C, D, E, or F in FIG. 10). 
The illumination position of the probe light can then be specified. 

Based on the illumination position of the probe light, the model 
calculation by which the layer thickness is calculated can be adjusted or 

1 5 changed as needed. Moreover, the target waveform at the process 
endpoint is also selected based on the location on the workpiece 
illuminated by the probe light. Thus, the polishing process can be 
controlled by detecting layer thickness and process endpoint regardless 
of the location on the workpiece illuminated by the probe light. 

20 Detection is facilitated by specifying the location on the workpiece 

(i.e., the specific portion of the pattern on the workpiece) to be 
illuminated by the probe light. Such a specification is based on the 
obtained spectral waveform. Multiple calculations are performed using 
the respective target waveforms corresponding to the various illumination 

25 locations on the device pattern. When these methods are used with an 
apparatus having a mechanism for changing the illumination location of 
the probe light on the workpiece, a desired location can be illuminated 
with the light by conducting a time-series analysis of the respective 
spectral waveform. 

30 In the case of an in-situ measurement conducted during polishing, 

data from various locations on the workpiece are continuously acquired 
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since the illumination location cannot be controlled under such 
conditions. However, by determining and specifying the illumination 
location based on the spectral waveform and processing the data 
corresponding to the illumination location, the process can be controlled. 
5 For example, only data from the same location on the workpiece can be 
selected each time for processing (e.g., any of the regions A, B, C, D, E, 
and F shown in FIG. 10). Alternatively, data acquired from various 
locations (e.g., any of the regions A, B, C, D, E, and F shown in FIG. 10) 
can be segregated out for individual processing. 
10 Specific parameters useful for determining and specifying the 

illumination location based on the spectral waveform are listed below: 

(1) difference between the largest local maximum and the 
smallest local minimum; 

(2) ratio of the smallest local minimum to the largest local 
15 maximum; and 

(3) smallest local minimum. 

One or more parameters can be selected from the foregoing list. 
The value of the selected parameter obtained from the spectral waveform 
by signal processing is compared with a previously stored value obtained 

20 by measurement or simulation calculation at a different pattern location. 
Thus, the illumination position is determined and specified. 

This embodiment can be applied not only to dielectric layers (e.g., 
inter-layer insulating films) but also to metal layers. For example, in 
forming an electrode-layer inlay in an integrated circuit, a metal layer is 

25 laminated over the entire surface of the wafer. Most of the metal is 

subsequently removed by etching or polishing, which leaves only certain 
portions of the surface covered with the metal. The spectral waveform 
of signal light reflected by a metal film is usually smooth. When a 
portion of the metal layer is removed to reveal the pattern, the spectral 

30 waveform of signal light reflected from the surface becomes influenced 
by the underlying dielectric layer. As a result, the spectral waveform of 
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the reflected signal light changes significantly. By observing this change, 
the measurement location on the metal layer of each pattern can be 
ascertained effectively. 

Whenever the illumination location can be determined and 
5 specified from the spectral waveform, one or more appropriate 
parameters is then obtained from the spectral waveform by signal 
processing. Each selected parameter is then used to monitor the process 
endpoint. It is desirable that the parameter(s) be selected from the 
following list: 

10 (1) one or more parameters selected from the set {local 

maximum, local minimum, (local maximum - local minimum), (local 
minimum/local maximum)}; 

(2) one or more parameters selected from the set {largest local 
maximum, smallest local minimum, (largest local maximum - smallest 

15 local minimum), (smallest local minimum/largest local maximum)}; 

(3) a dispersion (variance); 

(4) a component of an Fourier transform (e.g., of spectral 
reflectance); and 

(5) a cross-correlation function of the spectral waveform with a 
20 previously obtained spectral waveform obtained by, e.g., a simulation 

calculation. 

The parameter used to detect the process endpoint may be the 
same as the parameter used to determine a measurement location but 
may be different from the parameter used to determine the measurement 
25 position. 

Although this embodiment was described in the context of a 
polishing process using an apparatus as shown, e.g., in FIG. 2, this 
embodiment can be used with equal facility with any of various other 
apparatus that impart a physical change to the workpiece surface, such 
30 as an ion-etching apparatus or the like. 
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Example 6 

In this example, a Si0 2 inter-layer insulating film for an image- 
pickup device was layered on a 6-inch diameter wafer. An image-pick-up 
device has a layout as shown in FIG. 10. The regions A, B, and C have a 
high degree of fineness and contain active devices, capacitors, and the 
like. The regions D, E, and F have a low degree of fineness and contain 
wires and the like. 

The wafer was polished using a polishing apparatus as shown in 

FIG. 2 and described above. 

The measurement system was configured as shown in FIG. 3 and 
described above. The wavelength range of probe light in this example 
was 400 nm to 800 nm. The spot diameter of probe light on the wafer 
1 2 was about 2 mm. Reference data obtained from the spectral 
waveform measured from the pattern of the present image-pickup device 
was pre-stored in the signal processor 17. This reference information 
was used as a reference value for signal processing. 

The polishing slurry used in this example was an alkaline 
suspension of silica particles. The polishing pressure applied to the 
wafer 12 during polishing was about 100 g/cm 2 . The amount of probe 
light lost (mainly due to scattering) due to the presence of polishing 

slurry was below 1 %. 

Using the above-described apparatus, a preliminary measurement 
was performed on a sample wafer having the same pattern (image-pickup 
device) and shape as the product wafer. An Si0 2 insulating film was 
formed on the top surface of the sample wafer to about 1000 nm thick 
by CVD. The sample wafer was then polished. Polishing was 
terminated when the Si0 2 insulating film thickness was reduced to about 
500 nm, and spectral waveform signals were obtained. The obtained 
spectral waveform signals were then analyzed. 

The plots shown in FIG. 11 (a) are of the spectral waveform signals 
obtained from the regions A, B, and C, and indicate the degree of 
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fineness in these regions. FIG. 1 1 (b) shows the spectral waveform 
signals obtained from the regions D, E, and F, and indicate the lesser 
degree of fineness in these portions. Based on the similarities of the 
waveforms shown in FIG. 1 1 (a), the regions A, B, and C are classified in 
5 a "block 1 " category. Similarly, the regions D, E, and F are classified in 
another category referred to as "block 2." 

The sample wafer was polished to several prescribed thickness 
J5 values ranging from zero to the maximum prescribed thickness. The 

spectral waveform was measured after polishing reduced the insulating 
H 10 layer to each of the prescribed thickness values. The difference between 

1= the largest local maximum and the smallest local minimum was obtained 

^ for each of the blocks 1 and 2 at each of the various thickness values. 

C: Between blocks 1 and 2, a significant difference was observed between 

Li the largest local maximum and the smallest local minimum at the various 

15 thickness values. However, within the same block, no significant 
M difference was observed between the largest local maximum and the 

smallest local minimum at the various thickness values. This result 
allowed appropriate constants to be set between the values for block 1 
and the values for block 2. These values were stored as reference 
20 information. 

In other words, all possible values of the maxima and all possible 
values of the minima of each spectral reflectance were pre-stored for 
each block. Identification of a block from which the signal light 
propagates can be identified by comparing the measured maximum 
25 values with the pre-stored values of maxima for each block. 

In addition to these values, data useful for determining film 
thickness and for establishing the polishing endpoint were also 
determined from the differences between the largest local maximum and 
the smallest local minimum, and stored for future reference. 
30 An image-pickup device having the same shape as the sample 

wafer used to obtain the reference data was selected for use as a 
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product wafer. An Si0 2 insulating film was formed on the top surface of 
the product wafer to about 1000 nm thick by CVD. Polishing of the 
product wafer was then initiated. The difference between the largest 
local maximum and smallest local minimum of the spectral waveform 
5 was measured during polishing of the product wafer. The difference 
values obtained with the product wafer were compared to the values 
stored as reference data. Whenever a difference value was larger than 
the corresponding constant obtained with the reference wafer, the 
respective spectral waveform was produced by block 1 . Otherwise, the 

10 respective spectral waveform was produced by block 2. 

The difference between the largest local maximum and the 
smallest local minimum was used as the measurement parameter for 
detecting the polishing endpoint. Only regions corresponding to block 1 
were monitored, leaving block 2 unmonitored. During polishing, the 

15 difference between the largest local maximum and the smallest local 
minimum changed. This difference was compared with the film 
thickness or polishing endpoint data of the reference wafer to determine 
the polishing endpoint for the sample wafer. When the process endpoint 
was detected, polishing was terminated. 

20 Certain product wafers that had completed the polishing step were 

examined. Planarization of each product wafer was confirmed. It was 
also confirmed that each of the examined product wafers had been 
polished to within about 3 percent of the target thickness of 500 nm. 

25 Example 7 

This example differs from Example 6 in that a cross-correlation 
was calculated of the measured spectral waveform values with the 
recalled reference spectral waveform values associated with the polishing 
endpoint. The cross-correlations were used as the measurement 
30 parameter in place of the differences between the largest local maximum 
and the smallest local minimum. Otherwise, the measurements were 
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conducted using the same apparatus and methods as described in 
Example 6. 

In view of the differences from Example 6, the spectral waveforms 
of the polishing endpoint obtained from the regions of block 1 as film- 
5 thickness information or polishing-endpoint information were stored as 
reference information. 

The cross-correlations were determined as polishing progressed. 
The results were compared with the constant values stored as reference 
data. Whenever the results of the cross-correlations were larger than the 
1 0 corresponding constant values, it was determined that the respective 
spectral waveform came from the regions of block 1 . Otherwise, the 
spectral waveforms were deemed to have come from the regions of 
block 2. 

As in Example 6, only the regions of block 1 were monitored. 

1 5 During the polishing process, only the cross-correlation of the measured 
spectral waveforms and the reference spectral waveforms associated 
with the polishing endpoint (stored as reference data) was calculated. 
Polishing was terminated at the point on the plot at which the cross- 
correlation rapidly increased. 

20 Some of the product wafers that had completed polishing were 

examined. From such examinations, it was confirmed that the surface of 
each of the observed wafers had been planarized. It was also confirmed 
that each of the observed wafers had been polished to within about 3 
percent of the target thickness of 500 nm. 

25 

Example 8 

This example differs from Example 6 in that a local minimum was 
used as the measurement parameter for determining measurement 
position. Whenever the value of the measurement parameter was below 
30 a prescribed value, it was determined that the spectral waveform came 
from a block-1 region. Otherwise, the spectral waveform was deemed to 
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have originated from a biock-2 region. Except for these differences, 
measurements were conducted using the same apparatus and the same 
methods as in Example 6. 

As polishing progressed, the local minimum changed. The local 
5 minimum was compared to the film-thickness or polishing-endpoint data 
(previously stored) to determine the actual polishing endpoint. Upon 
reaching the polishing endpoint, polishing was terminated. 

Some of the product wafers that had finished being polished were 
examined. From the examinations, it was confirmed that the surface of 
10 each of the observed wafers had been planarized. Moreover, it was 

confirmed that each of the observed wafers had been polished to within 
about 3 percent of the target thickness of 500 nm. 

Example 9 

1 5 This example utilized the same measurement methods, apparatus, 

and position-detection methods as in Example 6 (i.e., based on a 
difference between the largest local maximum and the smallest local 
minimum). On a wafer, a metal (aluminum) layer was polished using 
CMP while the formation of plugs was monitored. At the start of 

20 polishing, the entire surface to be polished was covered with the metal 
layer. Probe light reflected by the metal layer at this stage exhibited a 
substantially flat spectral waveform. As polishing progressed and the 
underlying insulating layer began to be exposed, local maxima and local 
minima started to appear in the spectral waveform due to interference. 

25 The difference between the largest local maximum and the smallest local 
minimum was compared with a constant value previously measured 
using a test wafer and stored as reference data. If the value of the 
measurement parameter dropped below the prescribed constant value, 
then the spectral waveform was deemed to have been produced by a 

30 block-1 region. Otherwise, the spectral waveform was deemed to have 
come from a block- 2 region. 
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Seventh Embodiment 

When a probe light is illuminated onto a wafer including a device 
pattern that is two-dimensionally non-uniform, the waveform of the 
5 resulting signal light reflected from the wafer differs significantly from a 
waveform of signal light that would be otherwise be obtained by 
reflection of probe light from a non-patterned blank film. The signal light 
is also affected by interference between pattern features. 

In addition to ordinary reflected signal light from a wafer, many 
10 diffraction spots are produced, and the amount of light from such spots 
cannot be ignored. As shown in FIG. 1 2, the nth diffraction spot is a 
function of the pitch "d" of the pattern on the wafer and the wavelength 

X, of the probe light in the direction of the reflection direction 0. The nth 
diffraction spot is thus determined by the following equation: 

15 

dsine = nx (3) 

The amount of signal light at the nth diffraction spot changes in 
accordance with the thickness of the layer on the wafer in a manner that 
20 is different from the amount of change exhibited at a diffraction spot of 
any other order. Therefore, in order to determine the film thickness from 
the signal light, the diffraction order of the spot being measured must be 
known. 

According to conventional practice, the thickness of a patterned 
25 layer is determined by measuring diffraction spots only of prescribed 

orders (i.e., higher than first order). However, the conventional method 
cannot accommodate various patterns. As indicated by Equation (3), 
above, the direction q of the diffracted signal light (i.e., the position of 
the spot) depends on the period "d" of the features in the pattern. 
30 Providing a mechanism that can be used with any of various directions of 
diffracted light is extremely complicated. 
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In this embodiment, only the zeroth order of diffracted light (i.e., 
regularly reflected light) is detected. Thus, there is no need to take the 
influence of the pattern pitch into consideration when calculating the film 
thickness or detecting the process endpoint. This also substantially 
5 simplifies the calculations. Moreover, in measuring a device pattern, the 
influence of signal light scattered from bumps on the surface of the 
wafer can be a significant component of signal light besides the regularly 
reflected light. This component manifests itself as noise. By deliberately 
extracting and measuring only the zeroth-order light, a large portion of 

10 the noise caused by scattered signal light is eliminated. The direction of 
propagation of the regularly reflected signal light (zeroth order light) is 
constant regardless of the pattern. This makes it comparatively easy to 
configure a detection system for such light. 

In order to measure only the zeroth-order signal light, it is 

1 5 necessary to efficiently eliminate diffracted lights of order higher than 
zeroth order. The present embodiment achieves such a goal. 
Specifically, in this embodiment, probe light is illuminated vertically onto 
a measurement sample. The thickness of a layer on the measurement 
sample is detected from the zeroth-order signal light reflected from the 

20 measurement sample. 

An apparatus according to this embodiment is shown in FIG. 1 3. 
The apparatus comprises a source 35 of probe light P, lenses 36a-36f, a 
beamsplitter 37, a mirror 38, a light-blocking plate 31 (defining an 
aperture 31a), and a detector 39. Item 34 is a workpiece (e.g., wafer). 

25 The plate 31 defines a slit-shaped orifice 31a. The plate 31 blocks 

diffracted signal light S but allows regularly reflected (zeroth order) signal 
light to pass through the orifice to the detector 39. 

As described above, the directions of propagation of higher orders 
of diffracted light (orders greater than 0) are a function of the period of 

30 the features of the pattern on the wafer 34. Therefore, when unwanted 
diffracted light is blocked as described above, the allowable size range of 
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the slit orifice 31a differs depending on the pattern. Therefore, it is 
desirable that the width of the slit 31a be variable according to the 
pattern to be measured. 

Another effective method is to measure only the zeroth order light 
5 by detecting only signals emitted from pixels of an image-pickup device 
that receive zeroth order light. I.e., signal light is received by a two- 
dimensional image-pickup device (CCD or the like) sufficiently large to 
receive not only zeroth order signal light but also higher orders of 
diffracted light. However, the zeroth order light is selectively detected 
10 by the image-pickup device. 

At least one measurement parameter of the zeroth order light is 
measured. The measurement parameter is selected from the following 
that can be plotted against wavelength of signal light: 

(a) local maximum of a spectral characteristic of signal light, 
1 5 (b) local minimum of a spectral characteristic of signal light, 

(c) local maximum of a spectral characteristic - local minimum of 
the spectral characteristic, 

(d) local minimum of a spectral characteristic/local maximum of 
the spectral characteristic, 

20 (e) largest local maximum of a spectral characteristic of signal 

light, 

(f) smallest local minimum of a spectral characteristic of signal 

light, 

(g) largest local maximum of a spectral characteristic - smallest 
25 local minimum of the spectral characteristic, 

(h) smallest local minimum of a spectral characteristic/largest local 
maximum of the spectral characteristic, 

(i) dispersion of a spectral characteristic of signal light, and 

(j) an appropriate component of a Fourier transform of a spectral 
30 characteristic of signal light (e.g., a component whose magnitude is 
maximum). 
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A representative spectral characteristic is spectral reflectance. By 
monitoring the selected parameter(s), and by comparing the spectral 
characteristic signal with a previously stored spectral characteristic signal 
5 (obtained, e.g., by a simulation calculation), the thickness of the 

remaining layer being polished or the process endpoint can be detected 
with high accuracy. The zeroth order light of the signal light can 
alternatively be directly monitored without dispersing the light and 
without monitoring one or more of the above-listed parameters. 

10 The methods of this embodiment are not only effective when the 

thickness of the remaining layer being polished is measured using a probe 
light comprising multiple wavelength components, but also when the 
probe light contains a single wavelength component. 

Methods according to this embodiment can be performed using an 

15 apparatus as shown in FIG. 2 and discussed above. 

Example 10 

In this example, a Si0 2 inter-layer insulating film on an image 
pickup device being fabricated on a 6-inch diameter wafer was polished 

20 by CMP using the polishing apparatus of FIG. 2. To allow monitoring of 
the film thickness during polishing, the window 1 5 (made of quartz) had 
a diameter of 2 cm and extended through the polishing plate 14 and 
polishing pad 13. The polishing pad 13 was made of an epoxy resin and 
contacted the bottom surface of the wafer 1 2. The probe light P was 

25 illuminated through the window 1 5 and reflected signal light S returned 
through the window 15. 

The measurement optical system was configured as shown in FIG. 
3. The source 1 9 of probe light was a xenon lamp that produced white 
light. The probe light P was incident perpendicularly on the surface of 

30 the wafer 1 2. Signal light S passed through a pinhole 25a defined by the 
plate 25. The plate 25 blocked scattered and diffracted light. Signal 
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light passing through the pinhole 25a propagated to a diffraction grating 
G, The diffraction grating G resolved the wavelength of the signal light S 
so that different wavelengths propagated in different directions toward 
the linear sensor 24. A slit 20a defined by a plate 20 determined the 
5 area on the wafer surface illuminated by the probe light P. The 
wavelength measurement range of the linear sensor 24 was 
approximately 400 nm to 800 nm, The diameter of the illumination spot 
of probe light P on the wafer 12 was about 2 mm. The output from the 
linear sensor 24 was amplified and processed by the processor 17 (e.g., 
10 PC). A previously measured spectral intensity profile of the probe light P 
was used in determinations of a spectral characteristic of the signal light 
S. 

The polishing slurry 105 used in this example was an alkaline 
suspension of silica particles. The polishing pressure was about 100 

15 g/cm 2 . Light scattering due to the polishing slurry was less than 1 %. 

The aperture 25a defined by the plate 25 had a diameter of 500 
pm, which was effective for eliminating most of the scattered light and 
diffracted light of first-order or above. 

Results obtained from measurements of the spectral characteristic 

20 of the wafer being polished are shown in FIGS. 14(a) and 14(b). The 

spectral characteristic in this instance was spectral reflectance. In each 
of FIGS. 14(a) and 14(b), the curve for "measured" data (solid line) fits 
very well with the curve for "calculated" data (broken line) for 
reflectance of zeroth order diffracted light as a function of light 

25 wavelength. FIG. 14(a) shows data at start of polishing, and FIG. 14(b) 
shows data at the polishing endpoint. These results indicate that 
measuring only the zeroth order light provided accurate agreement with 
model data. 

The "calculated" spectral reflectance data were obtained using a 
30 wafer having a prescribed polished film thickness. The "calculated" data 
were compared to spectral reflectance data obtained during polishing of a 
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test wafer. At the moment the waveform of the spectral reflectance 
obtained by measurement agreed, within a certain range, with the 
waveform of the spectral reflectance obtained by calculation, it was 
determined that polishing of the subject thin film was completed. In 
5 examinations of wafers that had "completed" polishing, it was confirmed 
that the respective surfaces of the wavers had been planarized to within 
7 percent of the target film thickness. 

Example 1 1 

1 0 This example utilized the same optical system as used in 

Example 10, except that a helium-neon laser was used as the source of 
probe light (wavelength 633 nm). In this example, a spectral curve was 
not used. The polishing endpoint was determined by monitoring the 
intensity profile of probe light reflected from the wafer surface. This 

1 5 method was effective whenever the pattern on the wafer was simple. 
Also, by utilizing a slit in the same manner as in Example 10, it was 
possible to block all first order and higher orders of diffracted light. 
Since the diffraction spots could been seen with the unaided eye, it was 
visually confirmed that the diffraction spots were blocked by the slit. By 

20 tracking only the intensity of the zeroth order light by this method, the 
polishing endpoint was accurately detected. 

Eighth Embodiment 

A measurement optical system according to this embodiment is 
25 shown in FIG. 15. The system comprises a source 41 of probe light P 
(desirably a source of white light), a spatial coherence controller 42, 
lenses 43a, 43b, a beamsplitter 44, and a collimating lens 45 that directs 
probe light P to a surface 47 of a wafer 46. The spatial coherence 
controller 42 operates to control the spatial coherence length; to such 
30 end, the spatial coherence controller 42 can be, for example, an aperture. 
The apparatus also comprises lenses 48a, 48b, 48c for refracting 
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reflected signal light S, a mirror 49, a diffraction grating 50, and a light 
sensor 51. The source 41 can be, for example, a tungsten lamp, a metal 
halide lamp, a xenon lamp, or the like. The spatial coherence controller 
42 can comprise, e.g., a plate defining a slit aperture 42a. The plate 
5 desirably includes a mechanism (such as a screw mechanism, not 

shown) for adjusting the width of the slit 42a. Another slit (not shown 
in FIG. 15 but see FIG. 13) is preferably situated in the path of the signal 
light S for blocking all orders of diffracted signal light except for zeroth 
order light, thereby allowing only zeroth order light to pass to the sensor 

10 51 . The diffraction grating 50 can be any of various gratings, prisms, 

and the like useful for splitting different wavelengths of light (a grating is 
preferred). The sensor 51 is desirably a linear sensor comprising multiple 
light-receiving units that are one-dimensionally separated from each other 
(e.g. an optical diode type line sensor such as a "512 device"). The 

1 5 different wavelengths of signal light S dispersed by the grating 50 are 
received simultaneously by the sensor 51 . 

The surface 47 of the wafer 46 can be, e.g., an insulating layer 
formed on a semiconductor device pattern on the wafer 46. The source 
41 of probe light P is adjusted to a suitable spatial coherence length by 

20 adjusting the slit width in the spatial coherence unit 42. (The spatial 
coherence length is a distance, on the device pattern, in which the 
irradiated probe light has coherency. The spatial coherence length can 
be varied by adjusting the beam diameter of the probe light. The probe 
light P passes through the lenses 43a, 43b and the beamsplitter 44, is 

25 coilimated by the collimator lens 45, and is illuminated onto the surface 
47 of the wafer 46. Signal light S reflected from the surface 47 carries 
information regarding the surface 47. The signal light S passes through 
the lenses 48a, 48b, 48c. Zeroth order signal light S propagates to the 
grating 50 that disperses the zeroth order light according to wavelength. 

30 The wavelength-dispersed signal light is detected by the sensor 51 . 
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The relationship between the semiconductor device pattern on the 
wafer 46 and the coherence length is as follows. From an optical 
perspective, the semiconductor device pattern typically comprises a large 
number of extremely fine features that are two-dimensionally distributed 
5 in each of one or more laminated thin films on the wafer surface. Each 
thin film typically has a different pattern. The probe light P 
simultaneously illuminates multiple features of the pattern on the wafer 
surface 47 as shown schematically in FIG. 16. In FIG. 16, by way of 
example, the probe light P from the source 41 illuminates four features F 

10 of the pattern. The components of reflected signal light from the 

features F are superposed in a complex manner in the signal light S. For 
example, FIG. 17 shows a magnified vertical section of a portion of the 
pattern structure on the wafer surface 46, illustrating the principle by 
which the reflected signal light from individual features is superposed. In 

1 5 order to measure the thickness of a portion of the surface having a 

pattern structure, probe light P (desirably having a spot diameter larger 
than the size of individual features of the pattern) is illuminated onto the 
wafer surface 46. The influence of the signal light reflected from each 
feature illuminated within the spot is taken into consideration. For 

20 example, whether light 56 reflected from features having the same 
height mutually interfere or not interfere is considered. 

In FIG. 17, signal light produced by reflection from multiple 
features is simplified to provide a model for discussion. Item 46 is the 
wafer. Items 53 are individual features of the pattern (e.g., electrode 

25 features). The features 53 are covered with an insulating layer 54. In 
FIG. 17, components 56 of signal light are shown reflecting from the 
features 53 and from the surface of the insulating film 54. Each 
component 56 of the signal light is an amplitude component of 
interference light. The number of amplitude components is indicated by 

30 the number of upwardly directed arrows in the figure. The reference 
number 55 refers to the pitch of the pattern. 



DLS:lam 4641-59261 



Express Mail No. EL75401 981 2US 

- 51 - 



FIG. 17 shows three components of reflected signal light 56 (each 
component being reflected from a different surface). The reference 
number 58 refers to the spatial coherence length of the reflected signal 
light. The reference number 57 refers to a region above the pattern 
5 within which the components of reflected signal light interfere with one 
another. 

Whether an interference occurs in the region 57 is important from 
a practical and theoretical perspective. The occurrence of interference 
depends upon the fineness of the pattern and the spatial coherence 

1 0 length of the spot of incident probe light. If the pitch of the pattern 

inside the spot is shorter than the coherence length, then an interference 
is generated. Otherwise, no interference is generated. 

In general, the spatial coherence length of an optical system is 
infinite at an ideal point source of light. In all other optical systems, the 

15 spatial coherence length is finite. In this embodiment, the spatial 
coherence length is desirably made variable by changing the optical 
system (i.e., changing the arrangement of one or more optical members 
and the like). Thus, theoretically, the spatial coherence length can be 
varied if desired according to the fineness of the pattern. 

20 In this embodiment, the spatial coherence length can be 

controllably varied by changing the perspective angle of the source of 
probe light. According to Zelnike's theorem, a relationship exists among 
the spatial coherence length, the wavelength \, and the numerical 
aperture NA, as follows: 

25 

spatial coherence length ~ 0.61 ^/NA 

Hence, changing the numerical aperture NA of the optical system can 
result in a change in the spatial coherence length. More specifically, by 
30 changing the slit width of the probe-light optical system immediately 
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downstream of the light source in a manner as described above, the 
spatial coherence length can be controlled in a desired manner. 

Returning to FIG. 15, in performing measurements of a wafer 
surface containing a device pattern, it is desirable that the spatial 
5 coherence length be larger than the pitch of the pattern. For most 

patterns currently imprinted onto wafers, the pitch is usually less than 1 

^m. It is relatively easy to set the spatial coherence length of the probe 
light source sufficiently larger than 1 ^m. The intensity profile of signal 
light measured under such a condition agrees well with a signal-light 

10 intensity profile theoretically calculated under an assumed condition in 
which interference occurs in the signal light simultaneously reflected 
from multiple features on the surface of the film. However, if the pattern 
is coarser than usual and has a feature size of, e.g., about 10 ^m, then 
it is more difficult to set the spatial coherence length sufficiently larger 

1 5 than the pitch. When measuring the film thickness of such a coarse 
pattern, the signal light intensity is theoretically calculated on the 
assumption that that interference does not occur. Hence, in conducting 
measurements according to this embodiment, the coherence length of 
the probe light source is desirably smaller than the pitch of the pattern on 

20 the wafer surface. Under such conditions, the actually measured 

intensity profile of signal light agrees well with a theoretically calculated 
intensity profile of reflected signal light in which interference occurs. 

The intensity of reflected signal light can be calculated by 
executing an optical calculation (i.e., a calculation of theoretical spectral 

25 reflectance) of the superposition of light waves. The optical calculation 
is executed when interference does occur in the film-surface direction 
(i.e., lateral direction parallel to the film surface) and when interference 
does not occur in the film-surface direction with respect to, e.g., signal 
light reflected from multiple features having the same height. According 

30 to conventional practice, it is generally not easy to directly calculate the 
thickness of the film to be measured. Hence, according to this 
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embodiment, the theoretical intensity profile of signal light reflected from 
a patterned surface having a prescribed film thickness, feature size, and 
feature density is calculated. The calculated theoretical intensity profile 
is then compared with an actually measured intensity profile of reflected 
5 signal light to calculate the film thickness. 

More specifically, the theoretical intensity profile of reflected light 
from each of multiple films is calculated. The films are pre-selected 
based on an inter-layer distance that is determined by the measurement 
precision. The calculations are made on the assumption that interference 

10 occurs in the signal light reflected from multiple features simultaneously 
illuminated by the probe light or on the assumption that interference does 
not occur. The calculated intensity profile corresponds to the fineness of 
the pattern (pitch of the pattern). The intensity profile of reflected signal 
light can be calculated for each type of pattern, and data obtained from 

15 such calculations can be stored in a memory for later recall. 

It is difficult to theoretically calculate the intensity profile of signal 
light under a condition of partial interference (a condition that is 
intermediate a state in which an interference is occurring in the signal 
light and a state in which an interference is not occurring). One reason 

20 for such difficulty is that, with an optical system adjusted to have a 
coherence length sufficient to cause partial interference, the degree of 
partial interference changes in response to even slight changes in the 
fineness of the pattern. (The degree of partial interference is equivalent 
to a ratio of a portion of the pattern where interference occurs to a 

25 portion where interference is not occurring.) As a result, the theoretical 
calculations would require that the intensity profile of signal light be 
separately calculated under each of multiple different partial interference 
conditions for each pattern having a slightly different fineness. Such 
calculations are difficult to perform and utilize from a practical 

30 standpoint. 
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Therefore, according to this embodiment, it is very effective to 
vary the spatial coherence length of the probe light in accordance with 
the fineness of a pattern so as to better distinguish a state of 
interference from a state of non-interference in the signal light. 
5 The measuring method described above can be applied to a 

polishing apparatus comprising a device for detecting film thickness and 
for detecting a process endpoint, such as the apparatus shown in FIG. 2. 
Details of construction and operation of the FIG. -2 apparatus have 
already been discussed and such discussion is not repeated here. 

10 Further with respect to the FIG. -2 apparatus, the signal processor 

17 stores and recalls data on various intensity profiles of signal light, and 
compares a measured signal-light intensity profile with corresponding 
recalled data. The signal processor 1 7 also calculates a cross-correlation 
coefficient between the measured signal-light intensity profile and each 

15 of the recalled signal-light intensity profiles associated with the recalled 
patterns and the thickness of the recalled patterns. The signal processor 
1 7 then selects the recalled pattern having the largest cross-correlation 
coefficient, and regards the selected pattern and the thickness thereof as 
corresponding to the actual measured pattern and the thickness thereof, 

20 respectively. 

In performing a comparison of the measured pattern with a stored 
pattern based solely on cross-correlation coefficients, a determination of 
the film thickness of a measured pattern can be difficult depending on 
the type of the measured pattern. Hence, in addition to comparing cross- 

25 correlation coefficients associated with the signal-light intensity profiles, 
it is desirable to calculate cross-correlation coefficients of Fourier- 
transformed theoretically calculated intensity profiles (of stored data) 
with a Fourier- transformed intensity profile of an actual measured 
intensity profile so as to obtain multiple cross-correlation coefficients. It 

30 is also desirable to compare the multiple cross-correlation coefficients 
with each other or to compare a Fourier component of the measured 
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signal-light intensity profile with one or more Fourier components of the 
stored signal-light intensity profile. It is even more desirable to use both 
of these comparison procedures. 

In general, a device field on a wafer includes multiple patterns 
5 having various degrees of fineness and exhibiting various intensity 
profiles of signal light. In measuring film thickness, it is important to 
ascertain which of the multiple patterns is actually being illuminated with 
the probe light. As discussed above, all possible spectral reflectance 
profiles corresponding to the various blocks are pre-stored. The 

10 particular pattern being irradiated with probe light can be identified by 

comparing the measured spectral reflectance with corresponding spectral 
reflectance profiles that were pre-calculated and stored. In any event, 
the measurements are repeated as required as polishing is ongoing. 
Polishing is terminated when the target film thickness is achieved. 

1 5 The thickness of the layer being polished can be measured in situ 

as polishing is ongoing, or can be measured off-line by removing the 
wafer from the polishing apparatus. 

Whereas the invention has been described in connection with 
multiple embodiments and examples, it will be understood that the 

20 invention is not limited to those embodiments and examples. On the 
contrary, the invention is intended to encompass all modifications, 
alternatives, and equivalents as may be included within the spirit and 
scope of the invention as defined by the appended claims. 



